Smartlight Interaction System

ABSTRACT

The conference room automation apparatus employs a processor-based integrated movement sensor, lights, cameras, and display device, such as a projector, that senses and interprets human movement within the room to control the projector in response to that movement and that captures events occurring in the room. Preferably packed in a common integrated package, the apparatus employs a layered software/hardware architecture that may be readily extended as a platform to support additional third-party functionality.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 14/070,769 filed on Nov. 4, 2013 entitled SMARTLIGHT INTERACTION SYSTEM, which claims the benefit of U.S. Provisional Application No. 61/723,652, filed on Nov. 7, 2012. The entire disclosure of the above applications are incorporated herein by reference.

FIELD OF THE DISCLOSURE

This disclosure relates generally to conference room automation systems and more particularly to a processor-based integrated movement sensor, lights, cameras and projector that senses and interprets human movement within the room to control the projector in response to that movement and that captures events occurring in the room.

BACKGROUND

This section provides background information related to the present disclosure which is not necessarily prior art.

The conference room in most office environments will typically have a projector equipped with an analog video (VGA) connector, or sometimes a digital video connector, designed to be plugged into a user's laptop computer during the meeting. Control and use of the projector is strictly a manual operation. A user must physically plug his or her laptop into the projector, lower the room lights using conventional wall-mounted controls, point the projector at a suitable wall or projection screen, adjust the tilt angle of the projector, and focus. If another presenter wishes to project information from his or her laptop, the first laptop must be disconnected, the second laptop plugged in. If it is desired to project onto a different surface than originally selected, the projector must by manually moved.

SUMMARY

This section provides a general summary of the disclosure, and is not a comprehensive disclosure of its full scope or all of its features.

The disclosed apparatus, which we refer to herein as the Smartlight interaction system, offers a substantial improvement over the conventional conference room projector. The Smartlight interaction system (also referred to herein as “Smartlight” or “Smartlight System”) integrates lighting, sound, projection, and sensing capabilities to enable new collaboration, communication, and interactivity experiences. Smartlight provides a platform that can be amplified by third-party developed applications and hardware add-ons, and can create a new business ecosystem. Such ecosystem has the potential to provide far more utility and value to consumers.

The Smartlight system provides tight integration sensing, visual projection, audio input output, lighting, wireless computing and robotics in a compact form factor, and gives users new experience with augmented, just-in-time projected information, object scanning, audio I/O, and gestural user interface. It dynamically augments the environment and objects with media and information, with seamless connection with laptops, mobile phones, and other electronic devices. It transforms surfaces and objects into interactive spaces that blend digital media and information with the physical space. Potential application areas include business, education and home.

In one embodiment the Smartlight is adapted for a business meeting room environment. The embodiment offers a new interactive meeting experience with enhanced communication, collaborations and sharing.

For the business environment, the advantage of Smartlight is that it is a compact self-contained module compatible with standard ceiling fixtures and can be seamlessly integrated into meeting rooms, while at same time, it provides the following features in addition to standard lighting;

-   -   Smart projection to any surfaces in the room;     -   Adaptive smart LED lighting;     -   Occupancy sensing, user seating sensing, user identification,         gesture sensing;     -   Document scanning, taking high resolution image of objects and         writing;     -   Directional audio recording and playback;     -   Plug-and-play of users personal computer and mobile devices

The disclosed technology thus provides innovations at three levels:

-   -   Hardware design of customized sensing, projection and lighting;     -   Software design for intelligent interpretation of sensory         inputs, and autonomous control of sensing and audio-visual         outputs;     -   Platform design for customizability, flexibility and creating an         open, collaborative ecosystem.

Therefore, in accordance with one aspect, the Smartlight interaction system comprises an integrated case adapted to be installed in a room or within a defined space. At least one sensor is disposed within the case that detects human movement within the room or space. A display device, such as a projector, is disposed within the case and is responsively coupled to said sensor. A processor coupled to the sensor is programmed to interpret sensed movement and to control the projector in response to human movement within the room or space. Alternatively, a TV or other display devices having some components disposed within the case may be used.

Further areas of applicability will become apparent from the description provided herein. The description and specific examples in this summary are intended for purposes of illustration only and are not intended to limit the scope of the present disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

The drawings described herein are for illustrative purposes only of selected embodiments and not all possible implementations, and are not intended to limit the scope of the present disclosure.

FIG. 1 is a perspective view of the Smartlight apparatus installed in the ceiling of an exemplary conference room.

FIG. 2 is a perspective view of the conference room, with the Smartlight apparatus installed, showing how projection is effected.

FIG. 3a is a side view of a ceiling-suspended projector mounting apparatus;

FIG. 3b is a front view of the ceiling-suspended projector mounting apparatus of FIG. 3 a.

FIGS. 4a, 4b and 4c are exemplary use cases of the Smartlight apparatus.

FIG. 5 is a software architecture diagram illustrating one embodiment by which the processor is programmed to perform the Smartlight apparatus functions disclosed herein.

FIG. 6 is a software and hardware architecture diagram showing how data are passed between different levels of the architecture.

FIG. 7 is a software and hardware architecture diagram showing how control and feedback is passed between different levels of the architecture.

FIG. 8 is a perspective view of a room showing geometric relationships useful in understanding the robotic control algorithms.

FIGS. 9a and 9b are graphical representations useful in understanding the projection and imaging correction algorithms.

FIG. 10 is a graphical representation of the homography matrix used by the projection and imaging correction algorithms.

FIG. 11 is a block diagram illustrating an exemplary Smartlight platform embodiment.

FIG. 12 is a flowchart detailing the manner of user interaction with the Smartlight system.

Corresponding reference numerals indicate corresponding parts throughout the several views of the drawings.

DESCRIPTION OF PREFERRED EMBODIMENTS

Example embodiments will now be described more fully with reference to the accompanying drawings.

Hardware Design

In one embodiment a ceiling-mounting structure is configured to fit within the space of a standard office ceiling tile and is designed to support the system weight and to house control electronics and computer. Shown in FIGS. 1 and 2, an embodiment of the system comprises a pan, tilt and roll robotic base 10 that actuates a payload composed of various sensors and projectors. The payload is connected via cables to a stationary computation unit concealed above the ceiling 12. Also on a stationary support are sensors and actuators that do not need to move with the projector (e.g., speakers 14, LED lights 16, microphones 18). Required connections to the outside world are power and data (wired or wireless). In one embodiment the projection system comprises one or more hybrid laser/LED projectors for digital information projection. The lighting system may be implemented using programmed LED lights with electronically adjustable orientation capability to provide context-aware adaptive lighting.

In one embodiment the audio input output system comprises a pair of 1D microphone arrays. Working together, these arrays can record surround audio and implement 2D beam forming. The speakers are powered using Class-D amplifiers to provide high quality audio information to the room occupants with minimal weight and minimal heat dissipation.

The base 10 also incorporates an optical sensing system comprising one or more electronically controlled still and video cameras 19 for capturing images from within the room. The base also includes at least one pair of depth sensors 21 for room occupancy and interaction sensing as will be described. For example, the still camera can be a high resolution still camera (e.g., 10 mega pixel, 10× optical zoom) selected to have sufficient resolution for on-demand document scanning and image taking.

Although many different pan/tilt and roll robotic base configurations are possible. FIGS. 3a and 3b show one embodiment where the projector 20 with projector lens 22 is suspended from a ceiling-mounted platform using an ensemble of rotary units that permit movement of the projector up-down, panned side-to-side, and if desired tilted to true up the image with the projection surface, such as projection surface 24 in FIG. 2. In other words, the robotic base effects movement in the yaw, pitch and roll dimensions under the control of electronically controlled motors as at 26, 28 and 30.

The camera, depth sensor and the projector are mounted on a close-loop, servo-controlled robotic arm which supports pan/tilt motion. It enables on-demand projection, user interaction sensing and image taking toward any area of the room. A small computer, such as a Mac Mini, serves as the controller of the whole system. It wirelessly communicates with other personal devices in the room and with the Internet to support interaction with personal devices, and integration with cloud services.

As illustrated in FIG. 2, the projector 20 projects an image on surface 24. In this case the projected image includes an image of an analog clock 32 as well as textual information 34 and graphical or video information 36. As will be further explained, the robotic base can rotate the projector so that it projects its image down onto a suitable table surface 38 below, or onto another wall, such as wall 40.

Some of the basic use case scenarios of the robotically controlled projector and associated components are as follows.

Assisting Meeting and Collaboration

Shown in FIG. 4a , the meeting participants are standing at a projection table surface. The projector has been robotically controlled to project an image onto the table surface. Using the embedded depth sensors and cameras, the system identifies participants and tracks their body positions relative to the projection surface. A 3D audio/video meeting recording is captured by the microphones 18 and cameras 19 and this recording is digitally saved for later playback through the projector 20 and speakers 14, if desired. The system is further able to digitally process the captured images and sound (speech) and generate an automatic transcription that can be searchable for archive creation. The cameras may also be used for object and document scanning.

The system can also augment the traditional multi-site videoconference scenario (project interactive documents on table and participants at various positions on the wall, create a multi-site interactive whiteboard with “virtual participants” from other locations, generate private “close-captioning” for each participant depending on their language settings). If desired, the system can effect intervention into the meeting dynamic by suggesting speaker turns and by directing phases and output.

Finally, the system can help with space and time management, by projecting an overlay of the meeting agenda onto the projection surface (such as below the projected clock in FIG. 2 as at 34). This can help with starting and conducting the meeting in an organized and timely fashion. During the meeting the system can project an overlay of time indicators, onto the meeting presentation or via other sensory reminders to show the progress of time allocated for the meeting.

Natural Presentation

Shown in FIG. 4b , the system allows the meeting presenter to convey information naturally, as the system automatically anticipates and tracks the presenter's motion. Thus the system automatically identifies the projection surface(s) of interest based on shape and color attributes, for example. The system automatically identifies presenters and participants, noting head positions, and then prevents light from shining in people's eyes or on an unwanted surface. The presenter uses natural gestures and voice to control the presentation (i.e.; the projector and material being projected therefrom) and all surfaces are rendered interactive by the system monitoring the presenter's gestures as he or she presents. In FIG. 4b , the presenter is shown moving his or her hand in a lateral gesture, which the system interprets as a command to move the display from projection upon wall 40 onto wall 24. This allows the presentation to be moved to any location within the room, based on a user's demand or alternatively based on fully automated commands issued from the system.

If a user wishes to share a document during the meeting, the system facilitates this through wireless user interface (UI) sharing, wireless display sharing and cloud-based content sharing and rendering. Content can be retrieved and displayed automatically once the presenter has been identified by the system.

Augment Digital with Physical, Augment Physical with Digital

Shown in FIG. 4c , the system allows a presenter or user to augment projected documents as well as physical documents. This is accomplished, for example, by using a physical pen to annotate a projected document. For example, if the projection surface is a writeable whiteboard, a dry felt marker can be used to annotate the projected document, as shown in FIG. 4c . Physical documents can also be scanned by the system, using the camera 19 and then digitizing the captured image. These digitized images can then be projected and electronically augmented as discussed above.

The system supports the display and management of multiple layers of recorded documents and writing (annotation), where layers of physical writing may be placed on top. The user can also manipulate digital content with freehand gestures, including using touch gestures on surfaces. The system can also embed voice notes or other hyperlinks in the captured content.

Software Technology

The software system is a key enabler of the Smartlight with intelligent sensing, adaptive augmented projection and user interactivity. The software architecture encapsulates a large number of sensing and controlling technologies, makes multiple components work seamlessly together, and provides an abstract yet powerful interface to the application layer. The architecture employs advanced aggregated sensing and controlling algorithms for variety of input and output devices.

Software Architecture

The software architecture is based on layered design. The benefit of this design is to give the system high portability on different hardware components, greater flexibility on processing algorithms, and allow developers to easily develop powerful applications. From a physical application standpoint, the layered design is shown in FIG. 5. As illustrated there, the processor within the Smartlight operates upon software loaded into the attached memory of the processor. The software includes an operating system; the Windows operating system 90 has been illustrated in FIG. 5 by way of example. Added to the basic core Windows operating system functionality are a set of input/output driver software algorithms 91, to support communication with the various devices, such as the motion sensing device, the high definition (HD) camera, the robotic pan tilt mechanism, the projector, the LED lighting, the audio system (speakers) and the like. Associated with these driver algorithms are a set of drivers 92 that are designed to define the physical interface with the aforementioned devices.

Running on the operating system as an application or set of applications are the functional application algorithms 93 to support vision systems, motion server systems and projection management. The vision system, for example, includes algorithms that allow the system to be calibrated to a particular room environment and then to perform hand gesture recognition, face recognition, and other intelligent scanning operations.

The software architecture illustrated in FIG. 5 is also configured to support networked connections to other systems. This networked connection capability allows the Smartlight system to define a customizable platform which third parties can add to. Thus the architecture includes an application framework layer 94, connected by network 95 that is architecturally subdivided into an interaction server and an interaction manager. The interaction server supports communication with applications 97 running on another system, such as on another server either within the local area network 95 associated with the Smartlight or within the Cloud (Internet based server on an external network 96). The interaction manager supports the interaction functionality whereby applications running on another system are able to use the interaction resources provided by the Smartlight platform. The networked connection between the algorithm layer 93 and the application framework layer 94 may be implemented using a suitable protocol, such as the protobuf protocol. The networked connection between the application framework layer 94 and the application layer 97 may be implemented using the HTTP protocol, and/or webSocket protocol, for example.

From sensing perspective, shown in FIG. 6, the lowest layer 102 is the abstraction of the raw sensors and retrieval of raw sensor data. The next layer 104 is the algorithms that interpret and fuse the info from different sensors. The next layer 106 contains algorithms that generate high level information, e.g., gesture detection. The highest level 108 is the system management and applications/GUI.

From control/feedback perspective, shown in FIG. 7, the highest layer 110 is system management and applications. The next layer 112 is high level control modules that take high level commands from the applications, and use algorithms to generate low level control commands for the next layer, the low level hardware control level 114. The hardware control level 114, in turn, feeds the hardware layer 116. The hardware layer 116 occupies the lowest layer and serves as the interface to the native hardware.

Key Algorithms

The Smartlight is an autonomous projection, sensing and interaction system. This poses great challenge to the software algorithms, because the Smartlight needs to be able to turn to any target location, project onto any surface in the room with projection corrected rendering. The software algorithms enable it to sense user interaction and capture images at any location with high precision.

Robotic Control Algorithms

The robotic control algorithm employs an inverse kinematic algorithm that is applied to solve for motor position by finding the Jacobian and then using Newton's method to solve the kinematics equation. In this way, the robotic control algorithm is able to direct the projection to any point in the 3D space. FIG. 8 illustrates this.

Projection and Imaging Correction Algorithms

Because the projection direction is dynamic and toward any surface in the room, standard projector keystone control is not suitable. The Smartlight thus employs a custom keystoning calibration algorithm and projection correction algorithm for OpenGL rendering. In this regard, FIGS. 9a and 9b show how points in an image in projector space (FIG. 9a ) are translated into points in screen space (FIG. 9b ).

The method requires marking of 4 calibration points on the surface, and calculating a 3×3 homography matrix H by minimizing back-projection error:

${s_{i}\begin{bmatrix} x_{i}^{\prime} \\ y_{i}^{\prime} \\ 1 \end{bmatrix}} \sim {H\begin{bmatrix} x_{i} \\ y_{i} \\ 1 \end{bmatrix}}$ ${\sum\limits_{i}\; \left( {x_{i}^{\prime} - \frac{{h_{11}x_{i}} + {h_{12}y_{i}} + h_{13}}{{h_{31}x_{i}} + {h_{32}y_{i}} + h_{33}}} \right)^{2}} + \left( {y_{i}^{\prime} - \frac{{h_{21}x_{i}} + {h_{22}y_{i}} + h_{23}}{{h_{31}x_{i}} + {h_{32}y_{i}} + h_{33}}} \right)^{2}$

Then, as shown in FIG. 10, a 4×4 OpenGL transformation matrix 119 is derived from this homography matrix. By applying this OpenGL matrix in the rendering pipeline, the algorithm can correct any 2D or 3 rendering.

The direction of each still camera is also dynamic. Similar calibration algorithm is developed, and image warping algorithm using the homography matrix to correct the images.

Intelligent User Detection and Interaction Algorithms

The intelligent user detection and interaction algorithm detects if a user enters the room, which seats are occupied by other users, and also the user's hand gestures. The occupancy detection algorithm is based on the total volume change in depth image. The user seating detection is based on advanced image processing including threshold depth volume change in the seating area, blob finding, and likelihood matching. Hand interaction tracking is based on advanced image processing including background estimation and subtraction, blob finding, and blog tracking.

At the end of this document, see the exemplary source code appendix showing how interaction by gesture is handled by displaying projected “buttons” on any surface. The processor is programmed based on the disclosed source code, causing it to respond to user gestures that mimic the pressing of projected “buttons” on a surface, such as a tabletop surface. This capability would allow, for example, each user seated at a conference room table to check into the meeting and make meeting choice selections, such as choosing which native language the user prefers when text documents are displayed in front of him. Of course, the uses of such virtual buttons are numerous and the above is intended as merely one example.

Directional Audio Recording

The goal of the directional audio recording algorithm is to record 3D surround sound audio, and detect the identity (position) of the speaker. The algorithm combines the data from two 1-D microphone arrays to generate 2D sound direction info.

Platform Technology

As illustrated in FIG. 11, the Smartlight device provides a platform that supports third party applications and accessories, which may be added to provide additional features or enhanced functionality. Thanks to its clean, layered design, new applications and new hardware components can be added to fit a user's need. In this regard, there are three main components of the platform: Smartlight device, the application market, and the accessory market. The application market helps third party developers to develop, contribute and distribute new applications. The accessory market helps third party suppliers and developers to create new components and distribute them.

User Interaction, User Interface, Use Case Scenarios

FIG. 12 provides a block diagram detailing how user interaction is performed. When no one is present in the conference room [step 160] the Smartlight shows information on a screen [step 161] so that it is visible, for example, through the open door. In this way passersby can see what schedule is associated with that room. The Smartlight also manages room environment conditions, such as lights, temperature, security and the like [step 162].

Upon detection of a user entering the room [step 163] the configuration is adapted to take into account that the user has entered [step 164], new information may be displayed on the screen to welcome or instruct the user [step 165] and certain automated tasks may be commenced, such as turning on the conference center system, autodialing a conference meeting number, or the like [step 166].

Typically, users will register with the Smartlight system [step 167] which may be done by a variety of means, such as voice recognition, face recognition, or scanning a 3D object or ID card [step 168]. The user's ID may be displayed on the projected screen [step 169]. If desired, the system can share information with others during the registration process. This can include sharing virtual business cards among participating conference members [step 170]. The entire registration process can proceed in the users chosen natural language [step 171], based on the user profile stored within the Smartlight system. Thus if a participant is a native Japanese speaker, his or her information will be presented in the Japanese language. Other participants may simultaneously use other languages. Each users profile may also be used to load a personal interface for that user [step 172]. These personal interfaces may be projected onto the table in front of where that person is sitting. If the user moves to a different location, the Smartlight tracks the users location [step 173] so that the personal interface may be kept current no matter where the user happens to move.

The Smartlight responds to control commands [step 174] which may be effected by hand gesture, by voice, by keyboard, by pointing device entry, or by touch interface manipulation, for example. These control commands serve as the interface for physical equipment as well as virtual appliances within the conference room [step 175].

Conference rooms can serve various functions, such as giving presentations to an audience [step 176], brainstorming [step 180], using a whiteboard located in the room [step 184], or video conferencing [step 187]. The Smartlight facilitates each of these different uses as illustrated in FIG. 12. Thus presentations may be enhanced using Smartlight by responding to the presenters voice or gesture control [step 177], by producing animation effects of a projected image [step 178] and by optimizing the display surface [step 179]. Such optimization may include adding animation or movement effect to a projected presentation, or by automatically moving the placement of a projected image to avoid the presenter's body. The Smartlight will also adapt to different 3D environments automatically. Thus if objects are moved within the room, the presentation position is automatically adjusted.

Brainstorming is enhanced by the Smartlight by allowing participants to readily scan documents [step 181], to organize virtual documents [step 182] and to provide coaching [step 183] by projecting preprogrammed coaching instructions to help the participants think through all of the pending issues or even to “think outside the box.” Whiteboard use is enhanced by the Smartlight by tracking or linking speech to the images as they are drawn upon the whiteboard [step 185]. The video image of the whiteboard images as they are created is captured along with the speech, and the speech may be digitized and converted into searchable text using a speech recognizer. This facilitates later search of the spoken content of the whiteboard session. Objects may be projected onto the whiteboard, and through gesture interaction, these objects may be modified, by writing annotations on them, erasing annotations previously written, moving or relocating content, or merging previously generated and projected content with content newly added while drawing on the whiteboard [step 186].

Video conferencing is enhanced by the Smartlight in several respects. The Smartlight system can create a virtual attendee [step 188]: an attendee who is not physically present in the room but who is made to appear present by projecting an image of the person into the meeting room space. This virtual attendee can, for example, share documents with other participants [step 189]. Metadata information, including captions and translations may be captured as part of the data captured by the Smartlight system [step 190]. If desired, the Smartlight system can itself be a “participant” in the meeting, generating content seen by the other participants. This added content can be triggered or mediated by the movement (images) or voice commands from the other participants, or by other preprogrammed means [step 191].

User Detection, Registration and Tracking:

Smartlight is able to enable rich interactions, enhance collaborations and improve telepresence. In this regard, one issue with the current conference room experience is lack of context and personal information: who's who? where are my documents? who said what in the meeting? Smartlight addressed these issues by applying sensing and intelligent projection on the meeting table.

When the conference room door opens, Smartlight automatically lights the room and projects a meeting agenda onto the table or the wall. It detects where a user is seated and allows a user to register, using touch interaction and voice commands, or RFID. After the user is identified, personal documents and system controls are displayed close to the user. For example, Smartlight may display the last meeting actions and the user can use a hand gesture to open that document. Smartlight also communicates with user's personal device for identification, to augment a personal device, and let a personal device to augment the Smartlight.

Digitize, Augment, and Collaborate:

Among the key tasks and pain points of the typical business meeting is how to digitizing physical documents or objects, how to digitize and share user's writings, and how to collaborate on digital information.

Smartlight's high resolution imaging capability, multi-surface projection capability and interactivity revolutionizes the collaboration experience. A user places an object or document on the table. Smartlight zooms to the object and takes a high resolution image.

In another aspect, Smartlight displays an image on the table for focused discussion. A user can use hand gestures to move the projection to the wall for high resolution display. The user can then write on the wall to annotate the image. Smartlight tracks who is writing on the wall. A user takes an image of the writing layer and shares it. If desired, a user may erase the writing, write again, or capture the image again. Each of these changes is saved as a separate layer. All layers are recorded. By saving layers in this fashion, users can then modify documents and layers using hand gestures. The full meeting session is recorded as 3D video and 3D audio for future playback.

Teleconference/Video Conference:

Current teleconference systems require a number of devices, which are hard to set up, complex to operate, and require space on the meeting table. As one self-contained ceiling-mounted unit, Smartlight provides all the features of an A/V telecom system. It needs zero configuration and leaves the table clean. With Smartlight a teleconference or video conference proceeds like this.

Users enter the meeting room, and an A/V connection with remote sideshow automatically starts, using the cloud connectivity of Smartlight and the information retrieved from the room booking schedule.

The projector projects video stream on the wall, and the speakers and microphone array serves audio communication. The projector also projects a virtual communication control pad (showing buttons such as ‘mute’, ‘end meeting’, ‘share document’). Any user can use hand gestures to interact with these projected buttons. This eliminates the need of physical controllers.

Detailed User Interaction, User Interface, Use Case Scenarios

With FIG. 12 in mind, the scenarios below try to showcase some of the main features of the developed concept in the setting of a meeting room. Smartlight is capable of:

-   -   Exploiting its situation awareness (who speaks, what is said,         link information to source . . . ) for triggering an action         according to the type of event detected (people entering the         room, meeting starts, registration . . . ),     -   Capturing and digitalizing object to create digital interactive         copies (scanned document can be annotated, saved, and         transferred over the network),     -   Turning any surface into a display,     -   Displaying contextual information when the projector is not         explicitly used,     -   Recording and replaying audio,     -   Projecting on objects to allow for new scenarios such as turning         a simple foam model into an interactive product prototype with a         mock-up interface.

Situation Awareness:

Thanks to its sensors, the Smartlight can detect the presence of a user, register the user (explicitly) or identify the user (transparently), and keep track of his or her activities.

The registration process can be done by any process that identifies the user (speech recognition, face recognition, object . . . ) using biometric discriminative data sets previously recorded in user profiles, or using non-biometric data such as badges (with RF, magnetic, or optical markers) or passwords.

The information is collected directly by the Smartlight when possible or via a wireless accessory if needed (e.g., badge reader). Several identification/authentication methods can be used in complement to one another. One example below shows how one could use a personal device to snap the picture of a QR code containing session information and projected on a surface of the room at the beginning of a meeting. Registration can also be performed semi-automatically by having the Smartlight emit a signal (auditory outside the human audible spectrum, light (visible spectrum or not), or RF) that is localized to the room and captured by sensors on the personal devices. The simplest mechanism would involve people logging in to backend through the network (to identify themselves) and then entering a code displayed by Smartlight to verify their actual presence in the room.

One example below illustrate the use of business cards put down on the table surface to identify users (if authentication is not required). If the identify of all users in the room can be accurately determined, then the system can automatically select the content that should be made accessible during the meeting based on each user's respective access rights.

Users are tracked as they moved around the room using the depth sensor/RGB camera/3D audio capture. The corresponding information is added as a metadata stream to the meeting audio visual recordings and used in real-time to personalize the interactive interfaces projected in the vicinity of each user.

The attendees information can be used to label the audio transcription such as in the example below. In a similar fashion, all artifacts brought to the meeting can be traced back to their original owner/presenter.

Meeting Assistant:

Smartlight can identify the type of meeting from the original meeting invite and from the analysis of the meeting dynamics (activity of the participants, frequency of speaker turns, amount of slides projected, amount of content written, use of telecommunication facilities). This information is in turn used by Smartlight to assist meeting participants by suggesting specific tasks to the participants based on time and meeting progress, i.e.:

-   -   Smartlight keeps the meeting agenda and remaining time for all         to see as part of the projected content     -   When the remaining time is low, Smartlight will visually and         audibly notify the participants     -   If no action item or other essential meeting output has been         captured yet, Smartlight can remind the attendees of the         importance to remedy the issue in the time remaining     -   Smartlight can identify attendees with low participation and         encourage them to take the floor and express their opinion     -   Smartlight can identify when the discussion is running away from         the main topic and try to keep the meeting on track     -   Smartlight can display a small “prompter screen” on the wall         opposite of the presentation wall for the speaker to access         important information (presentation notes, timer, voice volume         in decibels, audience estimated arousal level), since the         attendees will be very likely to glance in that direction

Room Inventory:

Smartlight can track specific assets in the room and prevent objects from being removed from the room, identify misuse & abuses, or suggest help when users seem to experience operation issue with a given device. Thus Smartlight can check that the (whiteboard) walls have been cleared of markings when people exit the room, and that the room has been left in a clean state.

Digitalize and Manipulate the Objects

Using the Smartlight, any object can be digitalized and digital copies made instantly available in the same room or at a distant location. For example, in order for all people seating at a table to look at the same paper document, a digital copy is made by Smartlight and automatically presented to each participant with the correct orientation (and possibly proper language using automatically translation if the preferred reading language of the user is known by the system).

That digital copy may then be manipulated (modified, uploaded to a Cloud service and shared with participants): either through digital interaction (enabled by the Smartlight sensors and hyperlinking to other digital content including audio-visual content), or through physical augmentation (ink, object, picture, post-it) in which case it can be digitized again using Smartlight's capture infrastructure.

Smartlight allows this process to be iterative (leading to the creation of content “layers”) that can be individually retrieved, edited, and re-combined together. Smartlight also allows users to replay the editing process and retrieve intermediary versions of digital or physical edits (in a manner similar to revision control systems in the software development world).

Turn any Surface into a Display

The Smartlight can display on any surface and choose the best surface for each type of information. The depth sensor of Smartlight is used to identify surface types and their respective extents, whereas the RGB camera looks at color hue and uniformity. Smartlight can conduct projection tests to identify potential surface glares and to automatically adjust brightness/color in order to provide the best viewing experience to users (uniform color and brightness responses throughout the area). By default, the projection area is selected to be as planar as possible, though in certain scenarios the projection is made to map onto a selected object and the depth/RGB sensors are actively used to track the target surface in time in order to adapt the projection parameters accordingly. In all cases the depth sensor is used to correct the project perspective and present a rectified image to the users (software keystoning effect).

Surface selection will be impacted by user position as well (to avoid having users look too far on the side). If the projection is 3D, Smartlight will select the best surface and projection parameters to guarantee the best 3D effect to all (by reducing the viewing angle spread between users to the minimum).

Depending on the room configuration, the number and position of users, and the projection area location, Smartlight will automatically adjust the projection brightness and the room ambient light (which can be refined based on the detected participant activity, e.g., if they are taking notes, typing on their laptops, or watching the projected image. Local “reading lights” can be provided to each participant as well by Smartlight and those can adapt to the context, e.g., if the user is reading his laptop screen vs. pulling out a sheet of paper that required more secondary lighting).

Table top projection is an important differentiator of Smartlight in contrast to traditional meeting room projection system. In combination with the depth/RGB sensor enabled interactivity, a wide range of new scenarios are made possible, for example:

-   -   Presenting virtual digital copies of paper documents or other         artifacts that resemble the originals but that can be annotated         physically or digitally. Projection orientation can be corrected         for each participant when presenting separate content for each         attendee     -   Presenting interactive maps, architectural plans, or other         content that lends itself better to a horizontal representation     -   Allowing direct collaborative tangible manipulation of slides or         software code through physical or digital affordances     -   Carrying out training sessions with virtual objects with which         participants can naturally interact

If nobody is in the room, Smartlight displays useful information (next meeting schedule, time and date, weather . . . ). This information can be projected, for instance, on the door/sidelight/wall of the room and is visible from the outside if the door/sidelight/wall are made of a material suitable for back projection (frosted glass, for instance).

Smartlight can generate contextual personal interactive display zones on demand. For instance, presenting an open palm towards the ceiling and realizing a particular gesture could trigger Smartlight to create a button interface directly in your hand to control various functionalities of the room such as the teleconference system. Another example would be create a stack of virtual business cards of the other participants next to yours, a secondary display next to your portable device laying face up on the table, or a notification display on the back of your portable device if it is lying face down. Yet another example would be for the participants to be able to create an automatic closed-captioning display in front of them in the language of their choosing, with the option to seek back in time to review a topic, if needed.

The “display everywhere” feature can be advantageously used to enhance the experience of a presentation by adapting the projection surface to the content that is presented (having bar graphs coming straight out of a user's hands for instance).

Audio-Visual Recording and Replay

Meetings are automatically recorded using all sensor streams available in the system (including the 3D audio, 3D video, and high-resolution still pictures). The recorded content is augmented with higher level metadata resulting from initial analysis & interpretation of the data and from additional sensors (e.g., user information & location, audio transcript, document OCR, object recognition, meeting agenda, business card scans). Lastly, all explicit captured content from the meeting participants is added to the recordings (wall writing, document, or artifact pictures, projected content, voice annotations, meeting minutes and other digital notes).

The global pool of recorded content for each meeting constitutes a “meeting object” that is indexed and archived in a database that can be queried at a later time through a web interface or through the Smartlight system itself. Smartlight can identify the type of meeting from the original meeting invite and from the analysis of the meeting dynamics (activity of the participants, frequency of speaker turns, amount of slides projected, amount of content written, use of telecommunication facilities). This information is used to create a “digest” of the meeting that will allow participants as well as non-participants to quickly review the essential parts of the meeting. The digest contains all relevant bits of each stream stitched together (AV room capture, slides, scans, notes, closed captioning, speaker info). If the digest is played back on the Smartlight system itself, users can chose to “relive” the meeting where content, people, & artifact are projected in 3D video & 3D audio at their original place in the room. Machine translation can be applied at the request of the user on any spoken or written content. The indexing of the content allows searches across the entire database using simple keywords or semantically rich queries (“meetings about project X that included John and where we discussed industrial design”).

Meeting objects for a particular project are automatically clustered together to allow for easy overview & analysis of the content generated during the project meetings as well as the progress of the discussions.

Audio Visual Enhancements:

To improve the sound acquisition, 3D source localization is performed by the microphone array which then can form a beam in the direction of the source in order to focus on a specific speaker. The depth and RGB information from the various sensors are combined to create a 3D textured mesh model that can be visualized in a 3D engine and therefore the point of view can be adjusted at playback.

Give Contextual Information:

Smartlight can add some information on top of an object giving some context information.

Virtualization of Person/Object to Interact with:

Any object can be virtualized and a UI can be proposed to the user (phone, remote controller, light button . . . ). In the context of a teleconference, Smartlight can display the digitalized image of participants.

Table-Top Installation:

The same contraption can be used upside-down on a table and enable most of the same use-cases, apart from the ones requiring projection on the table surface.

Multi-Unit Installation:

Several Smartlights units can be installed in the same room to provide additional sensing and actuation coverage, as well as improving sensing capabilities (higher resolution 3D, better audio positioning). The ability to project on more surfaces at the same time opens the door to new scenarios (e.g., documents on the table, remote participants on one wall, presentation on another).

3D Audio Playback & 3D Projection:

Audio spatialization is especially interesting in the meeting replay scenario mentioned above and in general when trying to place a virtual attendee in a specific position of the physical space.

Virtual Pen/Eraser

If desired, one can define a virtual language for creating digital annotations on the projected content, or using real artifact as “props” that will be identified by the system and trigger a particular action (such as writing, erasing, emailing, capturing on picture).

Wireless Connection with any External Device (Laptop, Tablet, Smart Phone):

When connected to participants' personal devices, Smartlight can use the sensors, displays, and actuators from the personal devices to enhance the interaction, for example:

-   -   Smartlight uses microphones to capture better sound and improve         sound localization     -   Smartlight can use the device buzzer to attract attention from a         particular participant     -   Smartlight can use the device screen to display private or         high-resolution content     -   Smartlight can use the device camera to capture documents or         participants' faces

Other Types of Screens:

While projection screens and white wall surfaces are convenient for most conference room applications, other types of screens may be employed, including broadcasting the display image to a digital or analog monitor located in the room, or by streaming the display image to the screens of personal devices (laptops, tablets, smart phones) of participants in the room.

Combined with Laser Pointer:

If desired, a presenter may choose to use a laser pointer. The Smartlight system can track this laser pointer image and treat certain predefined motions as metadata commands, causing the projection to move or change in the same manner as if a hand gesture had been used as described above. Thus the laser pointer becomes another tool to move, reorganize or modify content projected during the meeting.

The foregoing description of the embodiments has been provided for purposes of illustration and description. It is not intended to be exhaustive or to limit the disclosure. Individual elements or features of a particular embodiment are generally not limited to that particular embodiment, but, where applicable, are interchangeable and can be used in a selected embodiment, even if not specifically shown or described. The same may also be varied in many ways. Such variations are not to be regarded as a departure from the disclosure, and all such modifications are intended to be included within the scope of the disclosure. 

What is claimed is:
 1. A method for operating an audio-visual system comprising: detecting presence of a user within a defined space; identifying said user by sensing said user's characteristic parameter; tracking said user's motion; associating said motion with one command of a plurality of predetermined commands of a computer program; and controlling at least one device according to said command.
 2. The method of claim 1, wherein said user's characteristic parameter is either biometric data or non-biometric data.
 3. The method of claim 2, wherein said biometric data is associated with speech recognition, face recognition or object data of said user.
 4. The method of claim 2, wherein said non-biometric data is associated with RF, magnetic, or optical markers or passwords.
 5. The method of claim 1, wherein said device has displaying interface.
 6. The method of claim 5, wherein said device displays a personal interface associated with said identification.
 7. The method of claim 6, further comprises displaying a personal document associated with said identification.
 8. The method of claim 1, wherein said users motion is hand gesture.
 9. The method of claim 1, further comprises enabling said user to conduct an audio/video conference system with one or more virtual attendees.
 10. The method of claim 9, wherein a setting of said audio/video conference is personalized and said setting is associated with said identification.
 11. An audio-visual operating system for controlling at least one device within a defined space comprising: one or more sensors that detect presence of a user and track said user's motion within the defined space; and a processor coupled to said sensors and programmed to control said device, wherein the processor is programmed to identify said user by sensing said user's characteristic parameter and to associate said motion with one command of a plurality of predetermined commands to control said device.
 12. The audio-visual operating system of claim 11, wherein said user's characteristic parameter is either biometric data or non-biometric data.
 13. The audio-visual operating system of claim 12, wherein said biometric data is associated with speech recognition, face recognition or object data of said user.
 14. The audio-visual operating system of claim 12, wherein said non-biometric data is associated with RF, magnetic, or optical markers or passwords.
 15. The audio-visual operating system of claim 11, wherein said device has a display.
 16. The audio-visual operating system of claim 15, wherein said device displays a personal interface associated with said identification.
 17. The audio-visual operating system of claim 16, wherein said device displays a personal document associated with said identification.
 18. The audio-visual operating system of claim 11, wherein said user's motion is hand gesture.
 19. The audio-visual operating system of claim 11, further comprises an audio video conference system with one or more virtual attendees.
 20. The audio-visual operating system of claim 19, wherein said audio/video conference system is personalized and associated with said identification. 